High Throughput and Cost Efficient VLSI Architecture of Integer Motion Estimation for H.264/AVC
نویسندگان
چکیده
Variable block size motion estimation (VBS-ME) is one of the contributors to H.264/AVC’s excellent coding efficiency. Due to its high computational complexity, however, VBS-ME needs acceleration for real-time high-resolution applications. This paper proposes a high throughput and cost efficient VLSI architecture for integer full-search VBS-ME in H.264/AVC. A new scan order is introduced to re-use the reference data, 16 4x4 SAD generators in parallel can produce 16 4x4 SADs at the same time, an adder tree unit can produce larger block’s SAD by using these small 4x4SADs. Integer-pixel ME module produces two groups of 41 integer motion vectors (IMVs) synchronously by checking two candidate points in parallel. After logic synthesis using SMIC 0.13 μm standard cell library, under a clock frequency of 300MHz, the integer ME architecture allows the processing of 1280x720(720HD) at 38fps with full search block matching algorithm(FS-BMA) in a search range [-32, +32].
منابع مشابه
An Efficient VLSI Architecture of Fractional Motion Estimation in H.264 for HDTV
Fractional Motion Estimation (FME) in highdefinition H.264 presents a significant design challenge in terms of memory bandwidth, latency and area cost as there are various modes and complex mode decision flow, which require over 45% of the computation complexity in the H.264 encoding process. In this paper, a new highperformance VLSI architecture for Fractional Motion Estimation (FME) in H.264/...
متن کاملHigh Performance VLSI Architecture of Fractional Motion Estimation for H.264/AVC
Due to its high computational complexity, fractional motion estimation (FME) needs acceleration for real-time high-resolution applications. This paper proposes a high performance and cost efficient VLSI architecture for full-search FME in H.264/AVC. For half-pixel ME, 112 4×4 processing elements are processing in parallel, the included half-pixel interpolator is based on 4×4 block, and is reali...
متن کاملAn Efficient VLSI Computation Reduction Scheme in H.264/AVC Motion Estimation
The variable block sizes motion estimation in H.264 is key technique to remove inter-frame redundancy. This technique not only requires huge memory bandwidth but also its computation complexity is higher. Therefore, this paper proposes one efficient sub-pixel search algorithm for reducing computation complexity and bandwidth utilization, and a novel VLSI architecture for this algorithm which si...
متن کاملAn efficient VLSI processor chip for variable block size integer motion estimation in H.264/AVC
Motion estimation (ME) is the most critical component of a video coding standard. H.264/AVC adopts the variable block size motion estimation (VBSME) to obtain excellent coding efficiency, but the high computational complexity makes design difficult. This paper presents an effective processor chip for integer motion estimation (IME) in H264/AVC based on the full-search block-matching algorithm (...
متن کاملEfficient Architecture for Variable Block Size Motion Estimation in H.264/AVC
This paper proposes an efficient VLSI architecture for the implementation of variable block size motion estimation (VBSME). To improve the performance video compression the Variable Block Size Motion Estimation (VBSME) is the critical path. Variable Block Size Motion Estimation feature has been introduced in to the H.264/AVC. This feature induces significant complexities into the design of the ...
متن کامل